Consistent Biclustering and Applications to Agriculture

نویسندگان

  • Antonio Mucherino
  • Alejandra Urtubia
چکیده

Consistent biclusterings of training sets can be exploited for solving classification problems in data mining. This technique has been mainly applied so far to solve classification problems related to gene expression data. However, it can be successfully applied to problems arising in other domains, and it is also able to provide information on the features causing the classification of the training set. We provide a quick overview of this technique, and we present a study on a particular problem arising in the agricultural field. We consider the problem of predicting problematic fermentations of wine at early stages of the vinification. The presented computational experiments show that the considered technique is able to provide some clues on the possible features causing the problematic fermentations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extending the definition of beta-consistent biclustering for feature selection

Consistent biclusterings of sets of data are useful for solving feature selection and classification problems. The problem of finding a consistent biclustering can be formulated as a combinatorial optimization problem, and it can be solved by the employment of a recently proposed VNS-based heuristic. In this context, the concept of β-consistent biclustering has been introduced for dealing with ...

متن کامل

Models and Issues in Consistent Biclustering

Biclustering is a methodology allowing simultaneous partitioning of a set of samples and their features into classes. Samples and features classified together are supposed to have a high relevance to each other which can be observed by intensity of their expressions. The notion of consistency for biclustering is defined using interrelation between centroids of sample and feature classes. Consis...

متن کامل

Fuzzy Adaptive Resonance Theory, Diffusion Maps and their applications to Clustering and Biclustering

In this paper, we describe an algorithm FARDiff (Fuzzy Adaptive Resonance Diffusion) which combines Diffusion Maps and Fuzzy Adaptive Resonance Theory to do clustering and biclustering on high dimensional data. We describe some applications of this method.

متن کامل

Biclustering in data mining

Biclustering consists in simultaneous partitioning of the set of samples and the set of their attributes (features) into subsets (classes). Samples and features classified together are supposed to have a high relevance to each other. In this paper we review the most widely used and successful biclustering techniques and their related applications. This survey is written from a theoretical viewp...

متن کامل

A structured view on pattern mining-based biclustering

Mining matrices to find relevant biclusters, subsets of rows exhibiting a coherent pattern over a subset of columns, is a critical task for a wide-set of biomedical and social applications. Since biclustering is a challenging combinatorial optimization task, existing approaches place restrictions on the allowed structure, coherence and quality of biclusters. Biclustering approaches relying on p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010